AITopics | bilinear attention network

Collaborating Authors

bilinear attention network

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Bilinear Attention Networks

Jin-Hwa Kim, Jaehyun Jun, Byoung-Tak Zhang

Neural Information Processing SystemsFeb-13-2026, 20:01:50 GMT

A man in a denim shirt and pants is smoking a cigarette while playing a cello for money.(c)

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country:

Asia > South Korea > Seoul > Seoul (0.05)
North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Bilinear Attention Networks

Neural Information Processing SystemsNov-20-2025, 22:41:52 GMT

Attention networks in multimodal learning provide an efficient way to utilize given visual information selectively. However, the computational cost to learn attention distributions for every pair of multimodal input channels is prohibitively expensive. To solve this problem, co-attention builds two separate attention distributions for each modality neglecting the interaction between multimodal inputs. In this paper, we propose bilinear attention networks (BAN) that find bilinear attention distributions to utilize given vision-language information seamlessly. BAN considers bilinear interactions among two groups of input channels, while low-rank bilinear pooling extracts the joint representations for each pair of channels. Furthermore, we propose a variant of multimodal residual networks to exploit eight-attention maps of the BAN efficiently. We quantitatively and qualitatively evaluate our model on visual question answering (VQA 2.0) and Flickr30k Entities datasets, showing that BAN significantly outperforms previous methods and achieves new state-of-the-arts on both datasets.

attention distribution, bilinear attention network, name change, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (0.61)

Add feedback

Efficient Bilinear Attention-based Fusion for Medical Visual Question Answering

Zhang, Zhilin, Wang, Jie, Zhu, Ruiqi, Gong, Xiaoliang

arXiv.org Artificial IntelligenceOct-28-2024

Medical Visual Question Answering (MedVQA) has gained increasing attention at the intersection of computer vision and natural language processing. Its capability to interpret radiological images and deliver precise answers to clinical inquiries positions MedVQA as a valuable tool for supporting diagnostic decision-making for physicians and alleviating the workload on radiologists. While recent approaches focus on using unified pre-trained large models for multi-modal fusion like cross-modal Transformers, research on more efficient fusion methods remains relatively scarce within this discipline. In this paper, we introduce a novel fusion model that integrates Orthogonality loss, Multi-head attention and Bilinear Attention Network (OMniBAN) to achieve high computational efficiency and strong performance without the need for pre-training. We conduct comprehensive experiments and clarify aspects of how to enhance bilinear attention fusion to achieve performance comparable to that of large models. Experimental results show that OMniBAN outperforms traditional models on key MedVQA benchmarks while maintaining a lower computational cost, which indicates its potential for efficient clinical application in radiology and pathology image question answering.

accuracy, machine learning, question answering, (20 more...)

arXiv.org Artificial Intelligence

2410.21

Country:

North America > United States > New York (0.04)
Europe > France > Grand Est > Bas-Rhin > Strasbourg (0.04)
Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.68)

Add feedback

AI could speed up discovery of new medicines

#artificialintelligenceFeb-13-2023, 02:50:26 GMT

Artificial intelligence that could reduce the cost and speed-up the discovery of new medicines has been developed as part of a collaboration between researchers at the University of Sheffield and AstraZeneca. The new technology, developed by Professor Haiping Lu and his Ph.D. student Peizhen Bai from Sheffield's Department of Computer Science, with Dr. Filip Miljković and Dr. Bino John from AstraZeneca, is described in a new study published in Nature Machine Intelligence. The study demonstrates that the AI, called DrugBAN, can predict whether a candidate drug will interact with its intended target protein molecules inside the human body. AI that can predict whether drugs will reach their intended targets already exists, but the technology developed by the researchers at Sheffield and AstraZeneca can do this with greater accuracy and also provide useful insights to help scientists understand how drugs engage with their protein partners at a molecular level, according to the paper published on February 2, 2023. AI has the potential to inform whether a drug will successfully engage an intended cancer-related protein, or whether a candidate drug will bind to unintended targets in the body and lead to undesirable side-effects for patients.

astrazeneca, interaction, new medicine, (14 more...)

#artificialintelligence

Genre: Research Report (0.92)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback